Extending The Coverage Of A Valency Dictionary
نویسندگان
چکیده
Information on subcategorization and selectional restrictions is very important for natural language processing in tasks such as monolingual parsing, accurate rule-based machine translation and automatic summarization. However, adding this detailed information to a valency dictionary is both time consuming and costly. In this paper we present a method of assigning valency information and selectional restrictions to entries in a bilingual dictionary, based on information in an existing valency dictionary. The method is based on two assumptions: words with similar meaning have similar subcategorization frames and selectional restrictions; and words with the same translations have similar meanings. Based on these assumptions, new valency entries are constructed for words in a plain bilingual dictionary, using entries with similar source-language meaning and the same target-language translations. We evaluate the effects of various measures of similarity.
منابع مشابه
Towards Automatic Extraction of Argument Structure from Corpora
The valency of predicates is a key component of a lexical entry because most, if not all, recent syntactic theories`project' syntactic structure from such information in the lexicon (e.g. Pollard & Sag, 1987). Therefore, a wide-coverage robust parser utilising a grammar based on one of these theories must have access to an accurate dictionary encoding (at a minimum) valency information and prob...
متن کاملFrom dictionary to corpus to self-organizing dictionary: learning valency associations in the face of variation and change
ing over specific lexically-governed particles and prepositions and specific predicate selectional preferences, but including some `derived' / `alternant' semi-productive, and therefore only semipredictable, bounded dependency constructions, such as particle or dative movement, there are at least 163 valency frames associated with verbal predicates in (current) English (Briscoe, 2000). In this ...
متن کاملVerbal Valency in the MT Between Related Languages
The paper analyzes the differences in verbal valency frames between two related Slavic languages, Czech and Russian, with regard to their role in a machine translation system. The valency differences are a frequent source of translation errors. The results presented in the paper show that the number of substantially different valency frames is relatively low and that a bilingual valency diction...
متن کاملA Method of Adding New Entries to a Valency Dictionary by Exploiting Existing Lexical Resources
Information on subcategorization and selectional restrictions in a valency dictionary is very important for natural language processing in tasks such as monolingual parsing, accurate rule-based machine translation and automatic summarization. However, adding this detailed information is both time consuming and costly. In this paper we present a method of assigning valency information and select...
متن کاملEvaluation of a Method of Creating New Valency Entries
Information on subcategorization and selectional restrictions is important for natural language processing tasks such as deep parsing, rule-based machine translation and automatic summarization. In this paper we present a method of adding detailed entries to a bilingual dictionary, based on information in an existing valency dictionary. The method is based on two assumptions: words with similar...
متن کامل